Dependency-Based PropBanking of Clinical Finnish

نویسندگان

  • Katri Haverinen
  • Filip Ginter
  • Timo Viljanen
  • Veronika Laippala
  • Tapio Salakoski
چکیده

In this paper, we present a PropBank of clinical Finnish, an annotated corpus of verbal propositions and arguments. The clinical PropBank is created on top of a previously existing dependency treebank annotated in the Stanford Dependency (SD) scheme and covers 90% of all verb occurrences in the treebank. We establish that the PropBank scheme is applicable to clinical Finnish as well as compatible with the SD scheme, with an overwhelming proportion of arguments being governed by the verb. This allows argument candidates to be restricted to direct verb dependents, substantially simplifying the PropBank construction. The clinical Finnish PropBank is freely available at the address http://bionlp.utu.fi.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards a Dependency-based PropBank of General Finnish

In this work, we present the first results of a project aiming at a Finnish Proposition Bank, an annotated corpus of semantic roles. The annotation is based on an existing treebank of Finnish, the Turku Dependency Treebank, annotated using the well-known Stanford Dependency scheme. We describe the use of the dependency treebank for PropBanking purposes and show that both annotation layers prese...

متن کامل

Familial Amyloid Polyneuropathy Type IV (FINNISH) with Rapid Clinical Progression in an Iranian Woman: A Case Report

Familial amyloid polyneuropathy (FAP) type IV (FINNISH) is a rare clinical entity with challenging neuropathy and cosmetic deficits. Amyloidosis can affect peripheral sensory, motor, or autonomic nerves. Nerve lesions are induced by deposits of amyloid fibrils and treatment approaches for neuropathy are challenging. Involvement of cranial nerves and atrophy in facial muscles is a real concern i...

متن کامل

Dependency Annotation of Wikipedia: First Steps Towards a Finnish Treebank

In this work, we present the first results obtained during the annotation of a general Finnish treebank in the Stanford Dependency scheme. We find that the scheme is a suitable syntax representation for Finnish, with only minor modifications needed. The treebank is based on text from the Finnish Wikipedia, ensuring its free distribution and broad topical variance. To assess the suitability of W...

متن کامل

Parsing Clinical Finnish: Experiments with Rule-Based and Statistical Dependency Parsers

In this paper, we present a new syntactically annotated corpus consisting of daily notes from an intensive care unit in a Finnish hospital. Using the corpus, we perform experiments with both rule-based and statistical parsers. We apply an existing rule-based parser specifically developed for this clinical language and create a set of conversion rules for transforming the constituency scheme of ...

متن کامل

Specifying Treebanks, Outsourcing Parsebanks: FinnTreeBank 3

Corpus-based treebank annotation is known to result in incomplete coverage of midand low-frequency linguistic constructions: the linguistic representation and corpus annotation quality are sometimes suboptimal. Large descriptive grammars cover also many midand low-frequency constructions. We argue for use of large descriptive grammars and their sample sentences as a basis for specifying higher-...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010